590 research outputs found

    Space-efficient Feature Maps for String Alignment Kernels

    Get PDF
    String kernels are attractive data analysis tools for analyzing string data. Among them, alignment kernels are known for their high prediction accuracies in string classifications when tested in combination with SVM in various applications. However, alignment kernels have a crucial drawback in that they scale poorly due to their quadratic computation complexity in the number of input strings, which limits large-scale applications in practice. We address this need by presenting the first approximation for string alignment kernels, which we call space-efficient feature maps for edit distance with moves (SFMEDM), by leveraging a metric embedding named edit sensitive parsing (ESP) and feature maps (FMs) of random Fourier features (RFFs) for large-scale string analyses. The original FMs for RFFs consume a huge amount of memory proportional to the dimension d of input vectors and the dimension D of output vectors, which prohibits its large-scale applications. We present novel space-efficient feature maps (SFMs) of RFFs for a space reduction from O(dD) of the original FMs to O(d) of SFMs with a theoretical guarantee with respect to concentration bounds. We experimentally test SFMEDM on its ability to learn SVM for large-scale string classifications with various massive string data, and we demonstrate the superior performance of SFMEDM with respect to prediction accuracy, scalability and computation efficiency.Comment: Full version for ICDM'19 pape

    Role of the PAS sensor domains in the Bacillus subtilis sporulation kinase KinA

    Get PDF
    Histidine kinases are sophisticated molecular sensors that are used by bacteria to detect and respond to a multitude of environmental signals. KinA is the major histidine kinase required for initiation of sporulation upon nutrient deprivation in Bacillus subtilis. KinA has a large N-terminal region (residues 1 to 382) that is uniquely composed of three tandem Per-ARNT-Sim (PAS) domains that have been proposed to constitute a sensor module. To further enhance our understanding of this "sensor" region, we defined the boundaries that give rise to the minimal autonomously folded PAS domains and analyzed their homo- and heteroassociation properties using analytical ultracentrifugation, nuclear magnetic resonance (NMR) spectroscopy, and multiangle laser light scattering. We show that PAS(A) self-associates very weakly, while PAS(C) is primarily a monomer. In contrast, PAS(B) forms a stable dimer (K-d [dissociation constant] o

    An interior penalty method for a finite-dimensional linear complementarity problem in financial engineering

    Get PDF
    In this work we study an interior penalty method for a finite-dimensional large-scale linear complementarity problem (LCP) arising often from the discretization of stochastic optimal problems in financial engineering. In this approach, we approximate the LCP by a nonlinear algebraic equation containing a penalty term linked to the logarithmic barrier function for constrained optimization problems. We show that the penalty equation has a solution and establish a convergence theory for the approximate solutions. A smooth Newton method is proposed for solving the penalty equation and properties of the Jacobian matrix in the Newton method have been investigated. Numerical experimental results using three non-trivial test examples are presented to demonstrate the rates of convergence, efficiency and usefulness of the method for solving practical problems

    A strategy to obtain axenic cultures of Arthrospira spp. cyanobacteria

    Get PDF
    A strategy to obtain axenic cultures of the cyanobacterium Arthrospira sp. (‘platensis’) Lefevre 1963/M-132-1 strain, consisting of a series of physical and chemical procedures, and the application of an optimized pool of antibiotics, is described in this paper. This strategy, which is an inexpensive and fast way to obtain axenic cultures, can be applied to Arthrospira spp. from culture collections or samples from their natural habitats to eliminate a wide spectrum of contaminants. A high alkaline treatment (pH 12, using KOH) of 72 h is a determinant initial procedure applied to eliminate protozoa and Microcystis sp. Bacteria were eliminated by an optimal antibiotic pool treatment, and Chroococcus sp. residuals were discarded by serial dilution. Optimal concentrations of the antibiotics composing the pool were obtained by a 24 factorial central composite rotatable design (CCRD) and Response Surface Methodology (RSM), resulting in: ampicillin 61.6 μg/ml, penicillin 85.8 μg/ml, cefoxitin 76.9 μg/ml, and meropenem 38.9 μg/ml. The results also indicate that cefoxitin was the most effective antibiotic of this pool. After obtaining the axenic culture, identification of Lefevre 1963/M-132-1 strain was performed using amplification and sequencing of the ITS region (including part of 16S rRNA, tRNA Ile, ITS, tRNA Ala and part of 23S rRNA region) and fatty acid composition data. Data base comparison revealed that Lefevre strain is closely related to A. platensis species (99% identity), while fatty acid composition data suggested A. maxima. These seemingly contradictory results are discussed

    Multilevel Selection in Models of Prebiotic Evolution II: A Direct Comparison of Compartmentalization and Spatial Self-Organization

    Get PDF
    Multilevel selection has been indicated as an essential factor for the evolution of complexity in interacting RNA-like replicator systems. There are two types of multilevel selection mechanisms: implicit and explicit. For implicit multilevel selection, spatial self-organization of replicator populations has been suggested, which leads to higher level selection among emergent mesoscopic spatial patterns (traveling waves). For explicit multilevel selection, compartmentalization of replicators by vesicles has been suggested, which leads to higher level evolutionary dynamics among explicitly imposed mesoscopic entities (protocells). Historically, these mechanisms have been given separate consideration for the interests on its own. Here, we make a direct comparison between spatial self-organization and compartmentalization in simulated RNA-like replicator systems. Firstly, we show that both mechanisms achieve the macroscopic stability of a replicator system through the evolutionary dynamics on mesoscopic entities that counteract that of microscopic entities. Secondly, we show that a striking difference exists between the two mechanisms regarding their possible influence on the long-term evolutionary dynamics, which happens under an emergent trade-off situation arising from the multilevel selection. The difference is explained in terms of the difference in the stability between self-organized mesoscopic entities and externally imposed mesoscopic entities. Thirdly, we show that a sharp transition happens in the long-term evolutionary dynamics of the compartmentalized system as a function of replicator mutation rate. Fourthly, the results imply that spatial self-organization can allow the evolution of stable folding in parasitic replicators without any specific functionality in the folding itself. Finally, the results are discussed in relation to the experimental synthesis of chemical Darwinian systems and to the multilevel selection theory of evolutionary biology in general. To conclude, novel evolutionary directions can emerge through interactions between the evolutionary dynamics on multiple levels of organization. Different multilevel selection mechanisms can produce a difference in the long-term evolutionary trend of identical microscopic entities
    corecore